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Abstract 

Consider networks on n vertices at average density 1 per unit area. We 
seek a network that minimizes total length subject to some constraint on 
journey times, averaged over source-destination pairs. Suppose journey 
times depend on both route-length and number of hops. Then for the 
constraint corresponding to an average of 3 hops, the length of the optimal 
network scales as n 13 / 10 . Alternatively, constraining the average number 
of hops to be 2 forces the network length to grow slightly faster than order 
n 3//2 . Finally, if we require the network length to be O(n) then the mean 
number of hops grows as order log log n. Each result is an upper bound in 
the worst case (of vertex positions), and a lower bound under randomness 
or equidistribution assumptions. The upper bounds arise in simple hub 
and spoke models, which are therefore optimal in an order of magnitude 
sense. 
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1 Introduction 

To design a network linking given points (envisage cities) in the plane, one 
might specify a cost functional and a benefit functional on all possible networks, 
and then consider the spectrum of networks which are optimal in the sense 
of minimizing cost for a given level of benefit. This suggests a mathematics 
program: 

for mathematically simple cost/benefit functionals, study the prop- 
erties (geometry, cost and benefit values) of such optimal networks 
as the number n of points tends to infinity, under either worst-case 
or typical case point positions. 

Network design problems arise in many applied fields, but real-world modelling 
leads to more complicated functionals than we have in mind [12]. Algorithmic 
issues in network optimization are another well-studied topic [5] , and the flavor 
of ongoing research in one academic Engineering community can be seen by 
browsing the journal Transportation Research. But the combination of n — > 
oo asymptotics and simple functionals has apparently not been systematically 
studied, though some recent statistical physics literature touches upon similar 
ideas (in particular [7], discussed in section [L3|) . 

Write x = x" = {xi, X2, ■ ■ ■ , x n } for a configuration of n vertices in the 
square [CI,™ 1 / 2 ] 2 of area n (we find this "density 1" scaling convention more 
natural than others). So Xi is the position of vertex i. Create a connected net- 
work G n by adding edges: edges are line-segments with their natural Euclidean 
lengths, and edges may meet at junctions not in the given vertex-set x. Define 
the network length len(G„) as the sum of the edge- lengths. This is perhaps 
the simplest choice of functional to serve as "cost". The kind of "benefit" we 
have in mind is that the network enables material to be transported between 
typical vertices i,j in short time. The simplest formalization of "time" is as 
route-length, and the trade-off in this setting is studied in [2] . In this paper we 
envisage a type of setting exemplifed by passenger airline travel where direct 
flights are not always available, and by package shipments via commercial car- 
rier. In such settings, the time to transfer between successive stages ("hops") 
of the journey is of the same order of magnitude as the transit times. As the 
simplest model for this phenomenon, fix a parameter A > 0. For any route n 
consisting of successive edges ei, e 2 , . . . , e m define the travel time for this route 
to be 



Here len(ej) denotes the length of edge ei. The first term represents time to 
transfer between successive hops, and the second term represents transit time. 
Note the scaling is chosen to make both terms have the same order of magnitude, 
as n — > oo, assuming the number of hops stays bounded. Now define the journey 
time between any pair of vertices to be 




time(i, j) 



min{timc(7r) : 7r a route from i to j} 
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and then for the network G n define the average journey time statistic 



time(G n ) = a.veu ! j\tims(i,j). 



Adding extra edges to a network will decrease time(G„) at the expense of in- 
creasing len(G„). In this paper we study the trade-off between len(G„) and 
time(G„) as n — > oo. 

Recall the concept of a hub and spoke network [llj , which became familiar 
in the U.S. in the 1970s with the FedEx (overnight package delivery) hub at 
Memphis, and in the 1980s with Delta Airlines establishing hubs in Atlanta, 
New York, Cincinnati and Salt Lake City. In such networks, a typical vertex 
is linked to only one or two relatively nearby hubs, while the hubs are well- 
connected to each other. Intuition (or the fact they form the basis of multibillion 
dollar industries) suggests that such networks should be roughly optimal with 
respect to the trade-off between length and time as defined above. We will prove 
asymptotic results in this spirit. Essentially, our results say that simple hub- 
and-spoke networks can achieve lengths which are the same order of magnitude 
as the optimal network. Understanding properties of the exact optimal networks 
seems a much harder problem, which we do not tackle. 

1.1 Some mathematical set-up 

In studying this topic we found it helpful to first consider the random model 
the n vertices of X™ are random (uniform, independent) in the square [0, n 1 / 2 ] 2 



and then consider the "worst case" , i.e. seek upper bounds for an arbitrary con- 
figuration x", and finally consider lower bounds, which inevitably require some 
constraints making the configuration x" be fully two-dimensional. It turns out 
that the order of magnitude for the random model is determined by the general 
upper and lower bounds, so we haven't recorded results for the random model 
explicitly except in Theorem [1] where a natural "sharp optimality" conjecture 
arises. 

For lower bounds we will use a quantitative version of an equidistribution 
property. Take integers L' n and consider the natural partition of the square 
[0, n 1 / 2 ] 2 into subsquares a of side s n = n 1 / 2 /L' n . For a configuration x" , define 
the "smoothed empirical distribution" tp as the distribution of a position Y 
obtained by picking a uniform random point x from x" and then taking Y 
uniform on the subsquare containing x. Write D Sri (x n ) for variation distance 
between ip and the uniform distribution on [Ojfi 1 / 2 ] 2 . Explicitly, 



where the sum is over the subsquares a and where |x™ n a\ is the number of 
points of x™ that lie in the subsquare ex. See section 12.11 for remarks on this 
definition. 



(1) 




a 
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Define 



C\ = mean distance between two random points in the unit square ~ 0.52. 

Write d(-, •) for Euclidean distance, and write 

dist(x) = nT 1 ' 2 aveu j\d(xi, Xj) 

for normalized average inter-vertex distance. For any network G n on vertices 
x™ we have 

time(G„) > dist(x") 

and so in comparing networks on different vertex-sets it is natural to consider 
the difference time(G n ) — dist(x"). It is clear that in the random model, the 
expectation satisfies 

E dist(X") -» Cl . (2) 

Finally, define the hop- diameter of a network to be the maximum (over i 
and j) of the minimum number of hops in a route from i to j. 

1.2 Statement of results 

We start with what turns out to be the most interesting point on the spectrum, 
where the constraint on "time" corresponds to an average of three steps in a 
route, and we seek to minimize network length subject to that constraint. Let 
us first consider a specific network. Write T for the vertex-set of the triangu- 
lar lattice whose Voronoi tessallation partitions the plane into hexagonal cells; 
standardize so that the density of vertices (and hence the area of hexagons) 
equals 1. It is classical [5] that this is the "best" tessallation in terms of mean 
distance from a random point to the closest lattice vertex: see [TU] for a recent 
treatment and history, and see [9] for a broad account of optimality properties 
of the triangular lattice. Define numerical constants 

C2 = mean distance from center of unit-area hexagon to random point in hexagon w 0.377 

C3 = distance from center of unit-area hexagon to corner of hexagon boundary = 

The 3-hop hexagonal network. Write T(L,n) for the vertices of the tri- 
angular lattice after scaling by L, so that cells have area L 2 , and then restrict- 
ing to vertices whose cells intersect the square [0,n 1/ ' 2 ] 2 . Define the network 
G 3 ~ hcx (x n ,L) to have edges 

(i) between each pair of vertices of T(L, n); 

(ii) from each vertex Xi to the closest vertex of T(L, n). 

So this is a "hub and spoke" network where the hubs are the vertices of T(L, n); 
obviously it has hop-diameter 3. 

Theorem 1 (a) In the random model, choosing 

L n = (2c 1 /c 2 ) 1 /5 n 3/io 
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we have 

^3 — hex 



E time(G (X n ,L„)) ™> 2A + Cl . (3) 
E len(G 3 ~ hcx (X", £„)) ~ /3n 13/1 °; /3 := 5 ■ 2- 9/r °c\ /b c 2 /b . (4) 
(7>J For arbitrary (x ra ), wii/i i/ie same choice of L n 

time(G 3 - hcx (x",L n )) - dist(x") < 2A + o(l). (5) 
len(G 3 - hcx (x",L„)) = 0(n 13 / 10 ). (6) 

(c^ Suppose (x™) satisfies the equidistribution property 

D Sn -> /or some s„ = 0(n 3/10 ). 
TTien cmy networks G n on x" satisfying 

time(G„) - dist(x") < 2A + o(l) (7) 

must satisfy 

len(G„) = fi(n 13 / 10 ), 
and this conclusion remains true if is replaced by the weaker assumption 

time(G„) - dist(x") < (3 - e)A + o(l) (8) 

/or /ixed e > 0. 

(Recall that a n = 0(6 n ) means b n = 0(a n )). In words, this result says that for 
networks with three-hop routes (i.e. an average of three hops), this particular 
hub and spoke model has length of order n 13 / 10 in both the random setting and 
the worst-case setting; and no other network can improve this order of magni- 
tude (assuming the equidistribution property) without increasing the average 
number of hops to almost 4. The unusual scaling exponent 13/10 makes the 
result memorable. The proof (section 12-21) uses nothing more than freshman 
calculus for (a) and (b), and little more than the coupling interpretation of 
variation distance for (c). We conjecture that the constant in (H|) is optimal: 

Conjecture 2 In the random model, any networks (G n ) satisfying 

E timc(G„) < 2A + ci + o(l) 

must satisfy 

Elen(G„) > (/3-o(l))n 13 / 10 

for j3 defined at Q). 

This may be fussy to prove, because it obviously involves the optimality property 
of the triangular lattice. Incidently, the only reason we use the triangular lattice 
at all is to formulate this conjecture; for order-of-magnitude results the square 
lattice would work just as well. 

Moving along the trade-off spectrum, the next result says that, to decrease 
the average number of hops to 3 — e, the length of the network must increase 
from order n 13 / 10 to order n 3 / 2 ; and with this length we can decrease the average 
number of hops to 2 + e. 



5 



Proposition 3 For 1 > e > there exist constants < b(A,e) and B(A,e) < 
oo satisfying the following. 

(a) For arbitrary (x n ) there exist networks (G n ) such that 

time(G„) - dist(x n ) < (l + e)A + o(l) (9) 
len(G„) < B(A,e)n 3 ^ 2 (10) 

and we may choose such networks to have hop- diameter 2. 

(b) Suppose (x n ) satisfies the equidistribution property 

D Sn — > for some s n = o(n 1 ^ 2 ). 
Then any networks G n on x" satisfying 

time(G„) - dist(x") < (2 - e)A + o(l) (11) 

must satisfy 

len(G„) > (6(A,e)-o(l))n 3 / 2 . 

The proof (section [2.3[1 is just a simpler analog of the proof of Theorem [TJ The 
construction in (a) is again a hub-and-spoke network, but now each point is 
linked to each hub. 

The situation at "2 hops" is somewhat different from the situation at "3 
hops" , as the next result shows: the optimal length grows just faster than order 
7i 3 / 2 . See section 12.41 for the proof. 

Proposition 4 (a) For arbitrary (x™) and for w n — > oo arbitrarily slowly, there 
exist networks (G n ) such that 

time(G„) - dist(x") < A + o(l) (12) 
len(G„) < Lo n n 3 ' 2 (13) 

and we may choose such networks to have hop- diameter 2. 
(b) Suppose (x™) satisfies the equidistribution property 

D Sn — * for some s n — o(n 1 ^ 2 ). 

Then any networks G n on x" satisfying 

time(G„) - dist(x") < A + o(l) (14) 

must satisfy 

n- 3/2 len(G„) oo. 

It is easy to guess intuitively how Theorem [T] and Proposition [4] should extend 
to the "average m-hops" setting for general m > 4. We outline this in section 
13.21 without details. 

For the final result we look at the trade-off the other way round. Constrain- 
ing network length to be 0(n), how small can wc make the average number of 
hops? Precisely, for vertices i,j in a network G n define hop(«,j) as the smallest 
number of hops in a route from i to j, then define hop(G n ) = ave(j j\hop(i, j) < 
hop — diameter(G„). 
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Theorem 5 (a) For arbitrary (x n ) there exist networks (G n ) such that len(G„) = 
0(n) and hop(G„) < hop — diameter(G„) < (2 + o(l)) log 2 logn. 
(b) Suppose (x" ) satisfies the equidistribution property 

Z) Sn (x n ) — > /or some s„ = o(exp(log^ n)) 

/or some /3 > 0. Then any networks G n on x" satisfying len(G„) = O(n) must 
satisfy hop(G„) > (1 — (3 — o(l)) log 2 logn. 

Conjecture 6 The lower bound in (b) can be improved to 2(1— j3— o(l)) log 2 logn. 

The construction for (a) uses a hierarchical network. See section 12.51 for the 
proof and a remark relating to the conjecture. 

1.3 Discussion 

There is a quite extensive literature on heuristic algorithms for designing optimal 
hub and spoke type networks: see the survey [I]. For a typical relevant paper 
in the discipline of transportation research (the optimal network for an airline 
in Western Europe) see pQ . The general phenomenon expressed by our results - 
that there is a big difference between average 3 hop and average 2 hop networks 
- has of course been noted in this literature (e.g. [13]), being very visible in 
any moderately large example, but we haven't found any previous attempt at 
theoretical analysis of this difference. As well as the specific scaling exponents 
we have derived, our work serves to verify that natural hub and spoke networks 
are indeed optimal in an order of magnitude sense. 

Closest to our work is that of Gastner and Newman [7], who describe es- 
sentially the same model. Without giving theoretical analysis, they show (their 
Figure 5) pictures of optimal networks for the U.S. (discretized to 200 points 
and population- weighted) . Their figure illustrates the expected qualitative be- 
havior as A increases: fewer hubs and longer spokes. It would be interesting to 
quantify this behavior. Unfortunately, as illustrated schematically in Figure 1, 
our results are insensitive to the value of A. Of course this highlights a general 
point that n — > oo asymptotics may not capture the features of real- world rele- 
vance. Note that time(G ra ) involves the product of A and average hop number. 
Varying A while holding time(G n ) fixed, as in [7], is moving along a hyperbola 
in Figure 1, so one expects qualitative changes in the geometry of the optimal 
network as one passes through integer values of average hop number. 

Implicit in our results is a remarkable and counter-intuitive fact. Consider 
first the A = case (no transfer cost). Some length of network is required to 
make the network connected, and one might guess that a non-negligible extra 
length of total network would be required to ensure that average route-length 
between two cities was only (1 + o(l)) times average Euclidean distance. But as 
studied in [2J, one can achieve this with total network length of only (1 + o(l)) 
times the length of the shortest connected network. Analogously, in the present 
A > setting, although time(G„) depends on both route-length and number 
of transfers, making the route- length be only negligibly longer than Euclidean 
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distance is easy to achieve, so our constraint on time(G n ) is effectively just a 
constraint on the average number of transfers. 



ave. hop 
number 



,7/6 



n 13/10 
_ n 13/10 
n 3/2 

— n 3 / 2 uj n 

n 5/2 



Figure 1. Schematic for order of magnitude of length of shortest network, as a 
function of the "transfer cost" parameter A (horizontal scale) and the average number 
of hops (vertical scale) . This illustrates that our results are insensitive to the value of 
A; it would be interesting to find more refined results capturing the influence of A. 



2 Proofs 

2.1 On the equidistribution hypotheses 

Proofs of lower bounds use the following coupling inequality, which is an imme- 
diate consequence of the definition (section of D Sri (x n ) and the coupling 
interpretation ([3] (A. 1.4)) of variation distance. 

Lemma 7 (coupling inequality) Let x = x™ be an arbitrary configuration. 
Let I be uniform on {1,2, . . . , n} and let U n be a uniform point in [C^n 1 / 2 ] 2 . 
Then we can couple xi and U n such that 

P{d( Xl ,U n ) > s n V2) < D Sn (x). 

Lower bounds use hypotheses of the format 

D Sn (15) 

for (s„) satisfying some bound. The classical equidistribution property 

the empirical distribution of {n -1 / 2 a;™, 1 < i < n} converges weakly 
to the uniform distribution on [0, l] 2 
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is equivalent to the property (| 1 5|) holding for some s n — o(n 1 / 2 ): this is the 
hypothesis in Propositions [3] and [4j Replacing one sequence (s n ) by a slower- 
growing sequence makes (I15|) a stronger assumption, so the hypotheses in The- 
orems Q] and O are stronger. A configuration of independent uniform random 
points satisfies (with probability — > 1) the requirement (|15p for any specified se- 
quence s n —* oo, however slowly. So for the random model our results establish 
the correct order of magnitude in the settings considered. 

2.2 Proof of Theorem Q] 

We first prove parts (a) and (b). Consider an arbitrary configuration x n . As 
a route in G 3_hex (x n , L n ) between i and j, use the 3-hop route i — > v(i) — > 
— ► j where is the vertex of T(L, n) closest to vertex i. For this route n 

time(7r) < 2A + n~ 1/2 (d(i,j) + <ic 3 L n ). 

So for any choice of L n = o(n 1/ ' 2 ) we get 

time(G 3 - h0X (x™, L n )) < 2A + dist(x") + o(l). 

This implies the time-bound ([5]) in the worst case, and (using ^) establishes 
([3]) in the random model. 

To calculate network length, note there are N n ~ n/L n hexagonal cells 
intersecting the square of area n. So the total length of edges linking cell centers 
is asymptotic to 

(^ B ) x ~ \(nlLlf x c^l\ 

In the random model, the mean distance from a vertex to the center of its 
hexagon is ciL ni so (the effect of boundary hexagons being asymptotically neg- 
ligible) the mean total length of such edges is ~ C2L n n. Similarly in the worst 
case, the total length of such edges is bounded by c^L n n. Combining: in the 
random model 

Elen(G 3 - hex (X",L„)) ~ 2~ 1 c l L^ A n b ' 2 + c 2 L n n. 
Minimizing the right side over L„, the minimum is attained by 

L n = (2 Cl /c 2 ) 1 /5 n 3/io 

and the minimized value is 

/V 3 / 10 ; (3:=b-2-^c\ r ° c y\ 

This gives @, and the worst-case bound ^ is similar. 

The proof of (c) is based on the following lemma. Fix a network G n on ver- 
tices x n , and let D Sn = D Sn (x n ) be as in section lTTTl Let / and J be independent 
uniform random vertices from {1, 2, . . . , n}. Consider the minimum-time route 
between I and J. Write p n (a, b) for the probability that this route has exactly 
3 hops, the middle hop having length > b and the other hops having lengths 
< a. 
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Lemma 8 

(n h\ < on 2^( a + SnV /2) 4 len(G») 

p n (a,b) < 2D Sn H — (16) 

n z b 

, 7r(2a + 6 + 2 3 / 2 s„) 2 4 len(G„) m/ N 

1-P„(a,6) < 2D Sn + K ^ + L^Z + p( ff?i > 4 ) (17) 

7i on 

where H n is the number of hops on the route from I to J. 

Proof. By the coupling inequality we can couple Xi and xj to independent 
uniform points U\, U 2 in [0, n 1 / 2 ] 2 so that 

P(d{x l7 U-i) > s n V2 or d(xj, U 2 ) > s„V2) < 2D Sn . 

From the definition of p n (a, b), 

p n (a,b) < 2D Sn +P(3e = {v u v 2 ) € £& : d(Z7 r ,fir) < a + s„V^,r = 1,2) 

where £ f, denotes the set of edges of G n with length > b, regarding each such 
edge as two directed edges. For a particular such directed edge {yi, V2), 



2\ 2 



P(d(U r , v r )<a + s n V2, r = 1, 2) < f + anV ^ 



and so 

Pn (aM<2D Sn + \8 b \x ^ a+ ^ 2 A 2 . 

But 6|fb| < 21en(G„) and we get (HHJ). For the other bound, write li for the 
length of the longest edge at Xi . Then 

. E£i T-li 21en(G„) , N 

P(ti >a)< — = ^s-i < 18 

a na na 

Now observe that if a pair of vertices (Xi , Xj ) satisfies none of the following three 
conditions 

(i) d{xi,Xj) < 2a + b 

(ii) max(£i,£j) > a 

(iii) number of hops on route from xi to Xj is > 4 

then (2^,2;^) satisfies the criteria in the definition of p n (a,b). So 

l-p n {a,b) < F(d(x I ,x J ) < 2a + b) +P(max(e I ,£j) > a)+P(H n > 4). 

Applying the previous coupling construction to the first term on the right side, 
and applying (fT8"|) to the second term, 

l-p n (a.b) < 2D Sn +¥(d(U 1 ,U 2 ) < 2a + b + 2s n V2) + 41en ( G?n - > + w(H n > 4). 
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By conditioning on Ui, 

¥(d{U ll U 2 ) <r) <Trr 2 /n (19) 

giving (JT7J). 

Returning to the proof of Theorem HJc), recall that we have a sequence of 
configurations x" such that D Sn — > for some s n = 0(n 3 / 10 ). Apply Lemma[5] 
with a n = n 3 / 10 and b n = Sn 1 / 2 and add the two inequalities; a brief calculation 
gives 

1 < (f + 4)n- 13 / 10 len(G„) + nS 2 + ¥(H n > 4) + o(l) 
for a constant C. Now 

timc(G„) - dist(x n ) > A(EH n - 1) (20) 

and so by hypothesis (O EH n < 3 + o(l) and so P(if n > 4) is bounded away 
from 1 (and the same conclusion follows from the weaker assumption ©). So 
by choosing S sufficiently small, 

n- 13/10 len(G„) = n(l) 

as required. 

2.3 Proof of Proposition [3] 

(a) Consider as before the vertices T(L, n) associated with the scaled triangular 
lattice. Define a network G n to consist of all edges (xi,v) where Xi £ x" and 
v £ T(L,n). For each pair of vertices consider the shortest two-edge route; 
it is easy to check that this route length is at most d(i,j) + 2C3L, and so 

time(G n ) - dist(x") < A + 2c 3 L/n 1/2 . 

The number of edges of G n is 0(n x n/L 2 ) and their lengths are Olri 1 / 2 ), so 

len(G„) = 0(n 5 / 2 /L 2 ). 

Taking L n ~ Sn 1 / 2 for small S establishes (a). 

(b) Fix r > 0. As before, write £i for the length of the longest edge at a?f. 
For each pair of vertices (xi,Xj) at least one of the following three conditions 
must hold. 

(i) max(4,y > Irn 1 / 2 

(ii) d(xi,Xj) < rn 1 ! 2 

(iii) the route from xi to Xj has at least three hops. 
So for random vertices Xi, xj 

2P(fj > \rn 1/2 ) + P(d{ Xl ,Xj) < rn 1 ! 2 ) + > 3) > 1 

where H n is the number of hops on the route from xj to xj. Using l|18p the 
first term is at most 8 ^if^ ^ ■ Using (|T9|) and the equidistribution hypothesis, 

¥(d(xi,xj) < rn 1 ' 2 ) < irr 2 + o(l). (21) 
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So 

^^>l-P(ff„>3)^r 2 - o( l). 

Using (j20|) as before, hypothesis implies that F(H n > 3) is bounded away 
from 1. So for sufficiently small r 

. SlenfAO 
hm mf ^— — > 

n TTl ' 

establishing (b). 



2.4 Proof of Proposition [4] 

Part (a) follows from Proposition EJa) and general convergence arguments. For 
(b) we use the following lemma, proved by a simple compactness agument. 

Lemma 9 For each k > 1 and p > there exists ry(/c, p) > swc/i that the 
following holds. Let U be uniform on the unit square. For each collection of 
points (x;vx,V2, . . . , Vk) in the unit square, and each function < f(u) < 1 with 

E/(C0 > P> 



E 



min (d(x, Vi ) + d( Vi , U)) - d(x, U) f(U) 

Ki<k 



Set 5 = j-k^ 1 / 2 and let k > 1. Consider a network G n on vertex-set x". Write 
h(i,j) for the number of hops on the minimum-time route from Xi to Xj. Define 
V as the set of i such that at most k edges at Xi have lengths > Sn 1 / 2 , and for 
i € V let Vi be the set of the other end-vertices of such edges. Define B to be 
the set of pairs such that h(i,j) = 2 and the first hop on the route from i 
to j has length > Sn 1 / 2 . 

Now consider a pair i, j such that i € V and € -B. Let (xi, v,Xj) be the 
route from i to j. So w g Vi. Using the triangle inequalities 

d(v,u) < d(v,Xj) + d{xj,u) 
d(xi,Xj) < d(xi , u) + d(xj , u) 

we find that, for any point u, 

d(xi, v) + d(v, u) — d(xi, u) < d(xi, v) + d(v, Xj) — d(xi, Xj) + 2d(xj, u). 
Because time(i, j) = n^ 1 ^ 2 (d(x i , v) + d(v, Xj)) + A we get 

n~ 1/2 mm (d(x u v) + d(v, u) - d{x u u)) l (i£ y u j)eB) 

< time(i, j) - A - n^ 1/2 d(x il x ] ) + 2n~ 1/2 d(xj,u). 
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Apply this inequality to uniform random I, J and the coupled pair (xj,U n ) 
given by Lemma [7] Taking expectations, 



A n := n- 1 / 2 

< time(G„) - A - dist(x") + 2 n - 1/2 Ed( Xj , U n ) 



min (d(xj,v) + d(v, U n ) - d(xi, U n )) l(/ e V,(z J)es) 



and the right side — * as n — > oo by hypothesis l|14p and by the coupling 
inequality. But by conditioning on I and applying Lemma [51 

A n > Er)(k,g(I)) where g(i) := P((t, J) G B) ■ l (ieV) . 

As n — > oo, the fact A n — > implies Eg (I) — > 0, that is 

P((J, J) G B, I G V) as n -> oo. (22) 

On the other hand, by symmetry (I, J) = (J, I) and the fact that one hop of 
a two- hop route covering distance > 2Sn 1 ^ 2 must have length > Sri 1 / 2 , 

p((J, J) G fl J) - 2,d(x/,a:j) > 25n 1 ' 2 ,i G V,j 6V)>J. 



So 



'((/, J) G S,t G V) > §P (/i(7, J) = 2,d( Xl ,xj) > 2Sn 1/2 ,ie V,j G v) 



Now hypothesis (HI]) implies Eh(I, J) < 2 + o(l). We must have ¥(h(I, J) = 
1) — ► because otherwise some non- vanishing proportion of all (2) possible 
edges would appear in G n , making len(G„) be of order n 5 / 2 . So 

¥(h{I, J) =2)-f 1. 

Using this and 

P ((I, J) G B, i G V) > § - 4tt<5 2 - o(l) - 2P(7 g V). 

We chose 5 to make AttS 2 = i, and now (122)) implies 

P(/ V) > I - o(l). 

The definition of V gives the first inequality in 

len(G„) > \h Sn 1 ' 2 \V C \ 

= |jfe <Sn 3/2 P(7 g V) 

> Jfc„3/2 ( 1 _ o(1)) . 



Because A; is arbitrary, we have proved n 3//2 len(G„) — * 00 
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2.5 Proof of Theorem [5] 

(a) Let x* be the center of the square [0, n 1 / 2 ] 2 . It is enough to construct 
networks such that the maximum hop-distance from any vertex to x* is < 
(1 + o(l)) log 2 logn. We do this via a hierarchical construction of hubs. Define 

h„ = log 2 log n 

and then take b n such that b n /h n — > oo and iSS^L _> o. Define 

s(n,0) = n 1/2 

m(n,j) = |V S K.7 - 1)/VI 
s(n,j) = s(n,j-l)/m(n,j),l<j<J n 

for 

J„ := min{j : s(n,j) < b n }. 

Then define 

s(n,j) = |s(n, j - 1), J„ + 1 < j < K n 

for 

K n := min{j : s(n,j) < 1}. 

Let iSj be the natural partition of [0, n 1 / 2 ] 2 into subsquares a of side s(n,j), and 
write x a for the center of the subsquare a. Given an arbitrary configuration x™, 
define the edges of G n to be 

(i) for each 1 < j < K n and each a G Sj, the edge (x a ,Xa), where a S Sj-\ 
denotes the supersquare containing a. 

(ii) for each x 6 x™, the edge (x, x CT ), where cr e 5^ is the subsquare containing 
x in the finest partition. 

So the natural route from x to x* uses 1 + K n hops. Ignoring the "integer part" 
in the definition of m(n,j) (it is easy to check this makes no difference to the 
asymptotics below), 

s(n,j) = s(n,j - l)/m(n, j) = y/s(n,j - l)h n 

which is 

log s(n,j) = ±logs(n,j - 1) + \ \ogh n 

and so 

log s(n,J n ) < 2~ Jn \ogs(n, 0) +logh n . 

If Jn > h n then the right side would be 0(log h n ), contradicting the definition of 
J„, and so we must have J„ < h n ultimately. Noting that K n — J n < 1 + log 2 b n 
we have shown K n < (1 + o(l)) log 2 logn, establishing the desired bound on 
hop- distance. 

To bound the network length, we have 

len(G„) < y/2 + V , " ., x V2s(n,j - 1) 
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and so 



len(G„) ^ < ^s(nj-l) 



iV2 



jr[ s 2 (n,j) 



E 



m (n, j) 



A„ 

E 



^s(n,j-l) 3 .J^ +1 s(n,j) 
< J n //i„ + 8 



using the definition of m(n,j) for the first term and the fact s(n, K n ) > 1/2 for 
the second term. Since J„ < /i n ultimately, we have proved len(G„) = 0(n). 

(b) Fix n and a network G n on vertex-set x™. From the definition of hop(G„ ) 
as an average, there exists some distinguished vertex v* such that 

ave^ hop(i,v*) < hop(G n ). 

Remark. The analysis below largely follows the idea in (a); a minimal-hop 
network for distributing material from a "center" v* to other vertices must be 
somewhat like the hierarchical scheme in (a) . In particular it uses longest edges 
in the first step. Of course, in the setting of hop(G„) this cannot be true for 
every initial vertex v, and this is where the argument is inefficient. 

Consider an array of integers 

m(n,j)>4; < j < J n 
which will be defined precisely later. Use this array to define 

s(n, 0) = n 1 / 2 /m(n, 0) 
and inductively for 1 < j < J n 

s ( n ;i) = s ( n ; j ~ l)/ m ( n T j)- 

Let Sj be the natural partition of [0, n 1 / 2 ] 2 into subsquares a of side s(n,j). We 
will bound (Lemma [10] below) hop-distance from v* in terms of the following 
construction which we envisage as an infection process. In this process, we define 
subsets Xj C Sj of squares a be be infected (either "primary" or "secondary" 
or both) according to the following rules. 

(i) Tq consists of the square a* € Sq containing the distinguished vertex v* . 
Now for a € Sj write a G Sj—i for the supersquare containing a. For 1 < j < 

(ii) a G Sj is infected as a secondary infection if it or a square of Sj_i adjacent 
to a is infected; 

(iii) a G Sj is infected as a primary infection if there exists an edge of length 
> s(n,j — 1) with one end- vertex in a. 

(In (ii), "adjacent" includes diagonally adjacent). Write Xj for the set of infected 
squares of Sj . 



15 



Lemma 10 Define IZj C Sj to be the set of squares a such that there exists a 
route from v* to some vertex x G a with at most j hops. Then IZj C Ij . 

Proof. IZo = To = {c*}, so suppose inductively that IZj-i C Ij-i- Let 
a G IZj, and consider the first edge e = (x,v) on a route from some x to v* 
with at most j hops. In the case len(e) > s(n,j — 1), then a is infected by (iii). 
Write a' G Sj-i for the square containing v* . So a' G C Ij-i is infected, 

and in the alternate case len(e) < s(n,j — 1) we have that a is the same as or 
adjacent to a', so o is infected by (ii). 



Figure 2. Upper bounding the spread of infection from a primary infection of 
square . For visual clarity we draw the case m(n,j) = 3. 

Now for any infected square of Sk, the infection can be traced back to some 
primary infection of some square of Sj for some < j < k. Conversely, each 
primary infection of some cr G Sj creates an epidemic of infection in smaller 
squares. It is geometrically clear (see Figure 2) that the total area of the squares 
thus infected in Sj n is bounded by C\ area(a^) for some constant C\. 
Writing A n for the total area of infected squares of S.j n , we see 

J n 

A n < C\ ^ s 2 (n,j) x \{<j G Sj : a has primary infection }|. 

Because a primary infection in Sj is caused by an edge of length > s(n, j — 1), 
for j > 1 

| {cr (z Sj : a has primary infection }| < 



So 



s(n,j - 1) 



A n < C.^— • -( i leu: ( i h : V 



m 2 (n,0) j^[s{n,j-l) 
^ + 2 Cl len(G„)V S( ' !J '- 1) 



m 2 (n, 0) " ^ m 2 (n,j) 
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Now suppose, for some 8 < 1/2, 

A„ < 8 and D. (ni j n) (x n ) < 6. (23) 

Then at least n(l — 28) vertices of x™ are not in infected squares of Sj n) and so 
by Lemma [TU1 these vertices are at hop-distance at least J„ + l from v* , implying 

hop(G„) > (1-2<5)(J„ + 1). (24) 

Now fix0<a<l-(3 and a constant B. Define 

J„ = a log 2 log n 

m(n,0) = \BC{\ 
m( 



So 



implying 



i(n,j) = \yjh n s(n,j - l)] . 

s(n, 7 — 1) 1 
v ,J L < — s j > 1 

m 2 (n,j) h n 



^1<J_ + 2C len ( G ") J " - 1 2(71 len ( G ") 
n ~ B 2 n h n B 2 B n ' 



Ignoring the "next integer" in the definition of m(n,j) (it is easy to check this 
makes no difference to the asymptotics below), 

s(n,j) = s(n,j - l)/m(n,j) = \J s(n,j - l)/h n 

which is 

log s(n,j) = | log s(n,j - 1) - ±log/i„ 

and so 

logs(n, J n ) > 2T Jn logs(n, 0) -log/i„. 
Using the definition of J n 

log 2 log s(n,J n ) > (l-a-o(l))log 2 logn. 

Now the sequence (s n ) in the hypothesis of (b) has 

log 2 log s n < (0 + o(l)) log 2 log n 

and so by choice of a we have s(n, J n )/s n — > oo. By hypothesis D Sri (x" ) — > 
and so under "greater smoothing" we have also 

£> s( „, Jn) (x n )^0. (25) 

Because len(G„) = 0(n), for given S > we can choose B such that n _1 A n < 6 
ultimately. Now the assumptions ([23]) are satisfied, and ([24]) implies hop(G„) > 
(1 — 25){J n + 1). Since 8 is arbitrary and we can choose a arbitrarily close to 
1 — (3 we have established (b) . 
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3 Final remarks 



3.1 Geometry of near-optimal networks 

In the setting of Theorem [1] (3-hop networks) we speculate that under equidis- 
tribution assumptions there is a certain "rigidity" property, that every near- 
optimal network has the same general features - a highly-connected "core" of 
about n 2 / 5 hubs. On the other hand, in the setting of 2-hop networks (Proposi- 
tion 3]) our construction is clearly not optimal in detail (e.g. because it provides 
two possible routes i — > j, via a hub near i or a hub near j, so one could con- 
struct more efficient networks containing only one of these alternatives). So in 
this setting we speculate that different near-optimal networks can have different 
statistical properties. 



3.2 Average m-hop networks 

Theorem [1] and Proposition [5] suggest that, if we constrain the average number 
of hops to be some integer m > 2, the minimum network length will scale as 
n °( m ) (possibly with an "w„ — > oo arbitrarily slowly" term) for some exponent 
9(m), where we know 

9(2) = §, 0(3) = i§. 

The following recursive behavior for m > 4 seems intuitively clear, though we 
will not attempt to give details. The optimal network has hubs at a certain 
density n~ J ( m \ Each point of x n is connected to its nearest hub. The hubs are 
connected as in the optimal networks for average m — 2 hops. By calculating 
the length of such a network and optimizing over "/(m) we get a recursion 

30(m-2)-l 
6{m) = 29(m-2) 

whose solution is 

9{2m) = 1+2^2, 0(2m + 1) = 1 + 

so that 

9(4) = f, 9(5) = §, 9(6) = ±§, 9(7) = % 



Note also that the formula gives 9(1) = |, corresponding to the length 0(n 5 ^ 2 ) 
of the compete graph on all n vertices. 



3.3 Hierarchical networks over random points 

Hierarchical networks over random points have been studied in other contexts. 
For instance Q3] emphasize the use of standard point process methods for 
finding exact formulas within such constructions built over Voronoi cells. 
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